Stabilizing Value Iteration with and without Approximation Errors

نویسنده

Ali Heydari

چکیده

Adaptive optimal control using value iteration (VI) initiated from a stabilizing policy is theoretically analyzed in various aspects including the continuity of the result, the stability of the system operated using any single/constant resulting control policy, the stability of the system operated using the evolving/timevarying control policy, the convergence of the algorithm, and the optimality of the limit function. Afterwards, the effect of presence of approximation errors in the involved function approximation processes is incorporated and another set of results for boundedness of the approximate VI as well as stability of the system operated under the results for both cases of applying a single policy or an evolving policy are derived. A feature of the presented results is providing estimations of the region of attraction so that if the initial condition is within the region, the whole trajectory will remain inside it and hence, the function approximation results will be reliable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stability Analysis of Optimal Adaptive Control using Value Iteration with Approximation Errors

Adaptive optimal control using value iteration initiated from a stabilizing control policy is theoretically analyzed in terms of stability of the system during the learning stage without ignoring the effects of approximation errors. This analysis includes the system operated using any single/constant resulting control policy and also using an evolving/time-varying control policy. A feature of t...

متن کامل

Implicit iteration approximation for a‎ ‎finite family of asymptotically quasi-pseudocontractive type‎ ‎mappings

In this paper‎, ‎strong convergence theorems of Ishikawa type implicit iteration‎ ‎process with errors for a finite family of asymptotically‎ ‎nonexpansive in the intermediate sense and asymptotically‎ ‎quasi-pseudocontractive type mappings in normed linear spaces are‎ ‎established by using a new analytical method‎, ‎which essentially‎ ‎improve and extend some recent results obtained by Yang‎ ‎...

متن کامل

Dhage iteration method for PBVPs of nonlinear first order hybrid integro-differential equations

In this paper, author proves the algorithms for the existence as well as the approximation of solutions to a couple of periodic boundary value problems of nonlinear first order ordinary integro-differential equations using operator theoretic techniques in a partially ordered metric space. The main results rely on the Dhage iteration method embodied in the recent hybrid fixed point theorems of D...

متن کامل

Verification and Validation of Common Derivative Terms Approximation in Meshfree Numerical Scheme

In order to improve the approximation of spatial derivatives without meshes, a set of meshfree numerical schemes for derivative terms is developed, which is compatible with the coordinates of Cartesian, cylindrical, and spherical. Based on the comparisons between numerical and theoretical solutions, errors and convergences are assessed by a posteriori method, which shows that the approximations...

متن کامل

Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors

This study is aimed at answering the famous question of how the approximation errors at each iteration of Approximate Dynamic Programming (ADP) affect the quality of the final results considering the fact that errors at each iteration affect the next iteration. To this goal, convergence of Value Iteration scheme of ADP for deterministic nonlinear optimal control problems with undiscounted cost ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1412.5675 شماره

صفحات -

تاریخ انتشار 2014

Stabilizing Value Iteration with and without Approximation Errors

نویسنده

چکیده

منابع مشابه

Stability Analysis of Optimal Adaptive Control using Value Iteration with Approximation Errors

Implicit iteration approximation for a‎ ‎finite family of asymptotically quasi-pseudocontractive type‎ ‎mappings

Dhage iteration method for PBVPs of nonlinear first order hybrid integro-differential equations

Verification and Validation of Common Derivative Terms Approximation in Meshfree Numerical Scheme

Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors

عنوان ژورنال:

اشتراک گذاری